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, Abstract. In Artificial Intelligence with Coalition Structure Generation (CSC) 

one refers to those cooperative complex problems that require to find an opti- 
mal partition, maximising a social welfare, of a set of entities involved in a sys- 
tem into exhaustive and disjoint coalitions. The solution of the CSG problem 
finds applications in many fields such as Machine Learning (covering machines, 
clustering). Data Mining (decision tree, discretization). Graph Theory, Natu- 
ral Language Processing (aggregation). Semantic Web (service composition), 
and Bioinformatics. The problem of finding the optimal coalition structure 
is NP-complcte. In this paper we present a greedy adaptive search procedure 
■ (GRASP) with path-relinking to efficiently search the space of coalition struc- 

, ' tures. Experiments and comparisons to other algorithms prove the validity of 

' the proposed method in solving this hard combinatorial problem. 

(N 
> 

1. Introduction 

An active area of research in Artificial Intelligence regards methods and algo- 
rithms to solve complex problems that require to find an optimal partition (max- 
, imising a social welfare) of a set of entities involved in a system into exhaustive and 

' disjoint coalitions. This problem has been studied a lot in the area of multi-agent 

systems (MASs) where it is named coalition structure generation (CSG) problem 
(equivalent to the complete set partitioning problem) . In particular it is interesting 
to find coalition structures maximizing the sum of the values of the coalitions, that 
represent the maximum payoff the agents belonging to the coalition can jointly re- 
ceive by cooperating. A coalition structure is defined as a partition of the agents 
' involved in a system into disjoint coalitions. The problem of finding the optimal 

coalition structure is TV'P-complete [HI [25] . 

Coalition generation shares a similar structure with a number of common prob- 
lems in theoretical computer science and artificial intelligence, such as in combi- 
natorial auctions; in job shop scheduling, Machine Learning, Data Mining, Graph 
Theory, Natural Language Processing, Semantic Web, and in Bioinformatics. In 
this paper we will use the term coalition structure generation as a general term to 
refer to all these grouping problems. 

Sometimes there is a time limit for finding a solution, the agents must be reactive 
and they should act as fast as possible. Hence for the specific task of CSG it is 
necessary to have approximation algorithms able to quickly find solutions that are 
within a specific factor of an optimal solution. The goal of this paper is to propose 
a new algorithm for the CSG problem able to quickly find a near optimal solution. 

The problem of CSG has been studied in the context of characteristic function 
games (CFGs) in which the value of each coalition is given by a characteristic 
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function, and the values of a coalition structure are obtained by summing the value 
of the contained coalitions. The problem of coalition structure generation is NV- 
hard, indeed as proved in }25j . given n the number of agents, the number of possible 
coalition structures than can be generated is 0{n"-) and Lo{n"'/'^). Moreover, in 
order to establish any bound from the optimal, any algorithm must search at least 
2" — 1 coalition structures. The CSG process can be viewed as being composed of 
three activities |25] : a) coalition structure generation^ corresponding to the process 
of generating coalitions such that agents within each coalition coordinate their 
activities, but agents do not coordinate between coalitions. This means partitioning 
the set of agents into exhaustive and disjoint coalitions. This partition is called a 
coalition structure (CS); b) optimization: solving the optimization problem of each 
coalition. This means pooling the tasks and resources of the agents in the coalition, 
and solving this joint problem; and c) payoff distribution: dividing the value of the 
generated solution among agents. Even if these activities are independent of each 
other, they have some interactions. For example, the coalition that an agent wants 
to join depends on the portion of the value that the agent would be allocated in 
each potential coalition. This paper focuses on the coalition structure generation in 
settings where there are too many coalition structures to enumerate and evaluate 
due to costly or bounded computation and limited time. Instead, agents have to 
select a subset of coalition structures on which to focus their search. 

In this paper we extend the work presented in [4| by adopting a stochastic local 
search (SLS) procedure [TT], named GRASP [5] improved with path-relinking jlOj . 
to solve the problem of coalition structure generation in CFGs. The main advantage 
of using a stochastic local search is to avoid exploring an exponential number of 
coalition structures providing a near optimal solution. Our algorithm does not 
provide guarantees about finding the global optimal solution. In particular the 
questions we would like to pose are: Ql) can the mctaheuristic GRASP with path- 
relinking be used as a valuable anytime solution for the CSG problem? In many 
cases, as in CSG, it is necessary to terminate the algorithm prior to completion due 
to time limits and to reactivity requirements. In this situation, it is possible to adopt 
anytime algorithms (i.e. algorithms that may be terminated prior to completion and 
returning an approximation of the correct answer) whose quality depends on the 
amount of computation; Q2) can the mctaheuristic GRASP with path-relinking be 
adopted for the CSG problem to find optimal solution faster than the state of the art 
exact algorithms for CSG problem? In case of optimization combinatorial problems, 
stochastic local search algorithms have been proved to be very efficient in finding 
near optimal solution [llj . In many cases, they outperformed the deterministic 
algorithms in computing the optimal solution. 

The paper is organized as follows: Section[5]introduces some applications sharing 
a common structure with the CSG problem. Section [3] presents basic concepts 
regarding the CSG problem, and Section |4] reports the related works about the 
problem. In Sections [5] and |6] the mctaheuristic GRASP and its exension with 
path-relinking applied to the CSG problem will be presented. Section [7] shows 
some implementation details and Section [5] will conclude the paper. 
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2. Applications 

In this section we report some eommon grouping problems in theoretical com- 
puter science and artificial intelligence that share a similar structure with the CSG 
problem. 

2.1. Discretization of attributes. An important problem in knowledge discovery 
and data mining is the discretization of attributes with real values [S] [3]. The 
discretization process facilitates the extraction of decision rules from a table with 
real value attributes. As presented in [3] the discretization selects a set of cut points 
of attributes determining a partition of the real value attributes into intervals. The 
set of cuts determines a grid in fc-dimensional space with Y[i=i '^i regions, where k 
is the number of attributes and Ui is the number of intervals of the z-th attribute. 
Checking if there is a consistent set of cuts such that the grid defined by them 
contains at most K regions in NP-complete [3]. 

2.2. Learning Bayesian prototype trees. In [20] the authors present a method 
to learn Bayesian networks, and specifically Bayesian prototype trees assuming that 
data form clusters of similar vectors. The data can be partitioned into clusters and 
the maximum likelihood estimates of the partitioning computed. A data parti- 
tion determines the corresponding Bayesian prototype tree model. Hence, given a 
training data set V consisting of N data vectors, the authors proposed to find the 
optimal Bayesian prototype tree by finding the optimal partition vector among the 

different vectors. 

2.3. Cluster ensemble problem with graph partitioning. Clustering and 
graph partitioning are two concepts strongly related. Clustering is a data analysis 
technique adopted in statistics, data mining, and machine learning communities, 
involving partitioning a set of instances into a given number of groups optimising 
an objective function. Recently, cluster ensemble techniques [27| improve cluster- 
ing performance by generating multiple partitions of the given data set and then 
combining them to form a superior clustering solution. \27\ [7] propose a graph 
partitioning formulation for cluster ensembles. Given a data set x = {Xi, . . . , A„}, 
a cluster ensemble is a set of clustering solutions, represented as C = {Ci, . . . , Cr}, 
where R is the ensemble size. Each clustering solution Ci is a partition of the 
data set into Ki disjoint clusters. The graph partitioning problem is to partition a 
weighted graph G into K parts by finding K disjoint clusters of its vertices. G is 
characterised by the set V of vertices and by a nonnegative and symmetric similar- 
ity matrix W characterising the similarity between each pair of vertices. The cut 
of a partition P is defined as Cut{P, W) = ^ w{i,j), where i and j are vertices do 
not belonging to the same cluster. The general goal of graph partitioning is to find 
a partition that maximise the cut. 

2.4. Aggregation for Natural Language Generation. Aggregation represents 
a main component of natural language generation systems. The task is to merge 
two or more linguistic structures into a single sentence. In [5] the authors presented 
an automatic tool for performing the semantic grouping task by formalising it as 
a CSG problem, where each coalition corresponds to a sentence. The strength of 
the proposed approach lies in its ability to capture global partitioning constraints 
by performing collective inference over local pairwise assignments. Pairwise con- 
straints capture the semantic compatibility between pairs of linguistic structures 
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at local level. The global task is to search a semantic grouping that maximally 
agrees with the pairwise preferences while simultaneously satisfies constraints on 
the partitioning as a whole. 

2.5. Privacy-preserving data mining. Privacy-preserving data mining [55] is a 
new research area that is focused on preventing privacy violations that might arise 
during data mining operations. To reach this goal many techniques modify original 
datascts in order to preserve privacy even after the mining process is activated 
ensuring minimal data loss and obtaining qualitative data mining results. In |17j the 
authors presented a technique considering anonymisations for classification through 
feature set partitioning. Feature set partitioning decomposes the original set of 
features into many subsets in order to induce a classifier for each subset. Then 
unlabclled instances are classified combining the votes of all classifiers. The work 
in [17] solves the problem of preserving k-anonymity via feature set partitioning as 
follows. Given a learner /, a combination method C, and a training set S with input 
feature set A, the goal is to find an optimal partitioning of the input feature set A 
into w mutually exclusive subsets. Optimality is defined in terms of minimisation 
of the generalization error of the classifier combined using the method C. 



In this section we present the basic notions about the CSG problem. 

Definition 1 (Coalition). Given a set N = {01,02, . . . ,o„} of n agents, \N\ = n, 
called the grand coalition, a coalition S is a non-empty subset of the set N , ^ ^ 
S<ZN. 

Definition 2 (Coalition Structure). A coalition structure (CS), or collection, C = 
{Ci, C2, . . . , Cfc} C 2^ is a partition of the set N, and k is its size, i.e. Vijj ■ 
C, n = and uf^iC = N. 

Given C = {Ci, C2, . . . , C^}, we define UC = U*LiC,;. We wih denote the set of 
all coalition structures of N as M{N). 

As reported in [1], assuming a comparison relation t>, At>B means that the way 
A partitions N , where N = = Ui3, is preferable to the way B partitions K . 

In this paper we consider the following rules that allow us to trasform partitions 
of the grand coalition. Given a CS C = {Ci, C2, . . . , Ct}: 

SPLIT:: C ^ C \ {CJ U {Ck, Ch], where U C;, = C,, with Ck, Ch + 0; 
MERGE:: C {C„ Cj],^j U {Cfc}, where Cu = C, U Q; 

SHIFT:: C C \ {C^,C-},^j U {C[,C]}, where C[ = C,\ {oj and Cj = 
Cj U {oi}, with Oi aCi. 
As in common practice |25[ 121] . we consider coalition structure generation in 
characteristic function games (CFGs). A CFG is a pair (TV, v), where N = {oi, 02, . . 
and V is a. function v : 2^ M.. Given C a coalition structure, v{C) = X^c ec ^(^Oj 
where v{Ci) is the value of the coalition Ci. Intuitively, v{Ci) represents the maxi- 
mum payoff the members of Ci can jointly receive by cooperating. As in [25j . we as- 
sume that v{Ci) > 0. In case of negative values, it is possible to normalize the coali- 
tion values, obtaining a game strategically equivalent to the original game [l^, by 
subtracting a lower bound value from all coalition values. For CFGs the comparison 
relation on coalition structures is induced in a canonical way, At>B <^=^ v{A)>v(B). 
It is possible to prove the following results: 
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Lemma 1 (|13]). Given two coalition structures A^B ^ M{N), with B, then 
A can be transformed into B by doing at most n — 1 applications of the SPLIT or 
the MERGE rules. 

Lemma 2 (|13j). Given two coalition structures A,BCz M{N), with A ^ B, then 
A can be transformed into B by doing at most n — 1 applications of the SHIFT rule. 

Now we can define the coalition structure problem as follows. 

Definition 3 (Coalition structure generation problem). Given a set of agents A, 
the coalition structure generation problem is to maximize the social welfare of the 
agents by finding a coalition structure C* = argmaxcg^v^^^) ^^(C)- 

Formally, a CSG problem may be formulated as a set partitioning problems 
(SPP). Let / = {!,..., m} be a set of objects, and let {Pi, . . . , P„} be a collection 
of subsets of /, with a cost Cj G R+ associated with each subset Pj. Given a n x m 
binary matrix A = {a,,,}, where aij = 1 if i inPj and a,;j = otherwise, let J be a 
solution of SPP represented as the n-dimensional vector .t = (x^, . . . , a;„) of binary 
decision variables. An integer programming formulation of the set partitioning 
problem is 

n 

z{x) = max^^ CjXj 

n 

subject to OijXj = 1, i = 1 . . . m. 

Given n agents, the size of the input to a CSG algorithm is exponential, since 
it contains the values v{-) associated to each of the (2" — 1) possible coalitions. 
Furthermore, the number of coalition structures grows as the number of agents 
increases and corresponds to ■^("■i where Zin, i), also known as the Stirling 

number of the second kind, is the number of coalition structures with i coalitions, 
and may be computed using the following recurrence: Z{n, i) = iZ{n — 1, i) + Z{n — 
1,1 — 1), where Z(n,n) = Z(n,l) = 1- As proved in [25], the number of coalition 
structures is 0{n") and aj(ri"/^), and hence an exhaustive enumeration becomes 
prohibitive. 

In this paper we focus on games that are neither superadditive nor subadditive 
for which the problem of coalition structure generation is computationally complex. 
Indeed, for superadditive games where v{S U T) > v{S) + v{T) (meaning any two 
disjoint coalitions are better off by merging together), and for subadditive games 
where v{S U T) < v{S) + v{T) for all disjoint coalitions S,T C A, the problem of 
coalition structure generation is trivial. In particular, in superadditive games, the 
agents are better off forming the grand coalition where all agents operate together 
(C* = {A}), while in subadditive games, the agents are better off by operating 
alone (C* = {{ai}, {02}, . . . , {a„}}). 

Instances of the CSG problem have been defined using the following distributions, 
as proposed in [TBI [22], for the values of the characteristic function v: 

• Uniform (U): v{C) ~ U{a, b) where a = and 6 = 1; 

• Uniform Scaled (US): w(C) - |C| • [/(a, 6) where a = and 6 = 1; 

• Normal (N): v{C) ~ N{fi,a^) where ^ = 1 and a = 0.1; 

• Normal Scaled (NS): v{C) ~ |C| • N{ii,a^) where ^ = 1 and a = 0.1; 

• Normally Distributed (ND): v{C) - N{n,a^) where = \C\ and ct = y/\C\. 
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Figure [T] plots the coalition structures' values according to the previous five 
distributions of the characteristic function for 10 agents. Each graph plots on 
the X-axis the value of the coalition structures whose cardinality is represented 
by a point on the y-axis. As we can see, it seems to be easy to find optimal 
solutions in the case of normal and uniform distributions, while it becomes more 
complicated for the case of scaled distributions. For the normal distribution, the 
optimal solution belongs to the less populated region corresponding to CSs with 
none or ten coalitions. The same scenario arises for the uniform distribution, even 
if here the region containing the optimal solution is more populated than in the 
previous case. The CFGs with scaled distributions are very hard to solve since 
the optimal solution may belong to very populated regions. More formally, given 
n agents, let c be the random variable of the value of a CS with k coalitions 
Ci. For each distribution the expected value of the c variable may be computed 
as follows: Ei7(c) = '}2^^{'^i) = ^/^ (maximum when k = n, many coalitions); 
E(7s(c) = X]f (maximum with few coalitions where it is more probable to 

assign a high value to each one); Ejv(c) = 1 = fc (maximum when k = n); 
IE ATS (c) = \Ci\ (maximum with few coalitions); E]yj){c) = \Ci\ (maximum 
with few coalitions). 



4. Related Work 

Previous works on CSG can be broadly divided into two main categories: exact 
algorithms that return an optimal solution, and approximate algorithms that find 
an approximate solution with limited resources. 

A deterministic algorithm must systematically explore the search space of can- 
didate solutions. One of the first algorithms returning an optimal solution is the 
dynamic programming algorithm (DP) proposed in |29] for the set partitioning 
problem. This algorithm is polynomial in the size of the input (2" — 1) and it runs 
in 0(3") time, which is significantly less than an exhaustive enumeration (0(n")). 
However, DP is not an anytime algorithm, and has a large memory requirement. 
Indeed, for each coalition C it computes the tables ti(C) and t2{C). It computes all 
the possible splits of the coalition C and assigns to ii(C) the best split and to t2{C) 
its value. In |21) the authors proposed an improved version of the DP algorithm 
(IDP) performing fewer operations and requiring less memory than DP. IDP, as 
shown by the authors, is considered one of the fastest available exact algorithm in 
the literature computing an optimal solution. 

Given a coalition C, |C| = rt, the number of splitting of C into two coalitions Ci 
and C2, with |Ci| = si and IC2I = S2 is computed as follows 



5(51,52) 



C(si -I- S2,S2)/2 if Si = S2 

C(si -|- §2, S2) otherwise 



where C(n, k) = (^,) is the binomial coefficient, i.e. the number of fc-combinations 
from a set with n elements. Now, the total number of splittings computed by DP 
is 

n s — 1 

Si)p = C(7i, s) S{s — k,k) 

s=l k=\s/2] 
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(a) Uniform 



(b) Uniform scaled 
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(c) Normal 



(d) Normal scaled 




(e) Normally distributed 

Figure 1 . Plots of the coalition structures' values according to the 
Uniform, Uniform scaled, Normal, Normal scaled, and Normally 
Distributed distributions of the characteristic function for 10 
agents. 

while those computed by IDP is 

n I s—1 

S/_DP = ^ C(n, s) ^ S{S - k,k)l{k<n-sys=n} 



where l{fc<n-svs=ra} is 1 if fc < « — s or s = n, otherwise. 
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Figure 2. Coalition structure graph for a 4-agent game. 



Both DP and IDP are not anytime algorithms, they cannot be interrupted before 
their normal termination. In |25j , Sandholm et al. have presented the first anytime 
algorithm, sketched in Algorithm [Tl that can be interrupted to obtain a solution 
within a time limit but not guaranteed to be optimal. When not interrupted it 
returns the optimal solution. The CSG process can be viewed as a search in a 
coalition structure graph as reported in Figure [2] One desideratum is to be able to 
guarantee that the CS is within a worst case bound from optimal, i.e. that searching 
through a subset N of coalition structures, k = min{fc'} where k' > y^^g, is finite, 
and as small as possible, where S* is the best CS and 5^ is the best CS that has 
been seen in the subset N. In [25] has been proved that: a) to bound k, it suffices 
to search the lowest two levels of the coalition structure graph (with this search, the 
bound fc = 71, and the number of nodes searched is 2"~^); b) this bound is tight; 
and, c) no other search algorithm can establish any bound k while searching only 
2"^^ nodes or fewer. 



Algorithm 1 Sandholm et al. algorithm 

(1) Search the bottom two levels of the coalition structures graph. 

(2) Continue with a breadth-first search from the top of the graph as long as 
there is time left, or until the entire graph has been searched (this occurs 
when this breadth-first search completes level 3 of the graph, i.e. depth 
n-3). 

(3) Return the coalition structure that has the highest welfare among those 
seen so far. 



A new anytime algorithm has been proposed in |22| , named IP, whose idea is to 
partition the space of the possible solutions into sub-spaces such that it is possible 
to compute upper and lower bounds on the values of the best CSs they contain. 
Then, these bounds are used to prune all the sub-spaces that cannot contain the 
optimal solution. Finally, the algorithm searches through the remaining sub-spaces 
adopting a branch-and-bound technique avoiding to examine all the solutions within 
the searched sub-spaces. IP can be used to find optimal coalition structure avoiding 
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to search most of the search space. As reported in [22], IP finds optimal solutions 
much faster than any previous algorithm designed for this purpose. 

As regards the approximate algorithms, in |26| it has been proposed a solution 
based on a genetic algorithm, which performs well when there is some regularity 
in the search space. Indeed, the authors assume, in order to apply their algorithm, 
that the value of a coalition is dependent of other coalitions in the CS, making the 
algorithm not well suited for the general case. A new solution [13] is based on a 
Simulated Annealing algorithm [T3], a widely used stochastic local search method. 
At each iteration the algorithm selects a random neighbour solution s' of a CS s. 
The search proceeds with an adjacent CS s' of the original CS s if s' yields a better 
social welfare than s. Otherwise, the search is continued with s' with probability 
g(y(s )-v{s))/t^ where t is the temperature parameter that decreases according to 
the annealing schedule t = at. 

5. GRASP FOR THE CSG problem 

The resource limits posed by some intelligent systems, such as the time for finding 
a solution, require to have approximation algorithms able to quickly find solutions 
that are within a specific factor of an optimal solution. In this section we firstly 
present the anytime algorithm for CSG proposed in [3] with some improvements 
and then its extension with path-relinking. 

A method to find high-quality solutions for a combinatorial problem is a two steps 
approach consisting of a greedy construction phase followed by a perturbativ^^ lo- 
cal search [11]. Namely, the greedy construction method starts the process from 
an empty candidate solution and at each construction step adds the best ranked 
component according to a heuristic selection function. Successively, a perturbative 
local search algorithm is used to improve the candidate solution thus obtained. Ad- 
vantages of this search method, over other stochastic local search algorithms, are 
the much better solution quality and fewer perturbative improvement steps to reach 
the local optimum. Greedy Randomized Adaptive Search Procedures (GRASP) [6] 
solve the problem of the limited number of different candidate solutions generated 
by a greedy construction search methods by randomising the construction method. 
GRASP is an iterative process, in which each iteration consists of a construction 
phase, producing a feasible solution, and a local search phase, finding a local opti- 
mum in the neighborhood of the constructed solution. The best overall solution is 
returned. 

Algorithm [2] reports the outline of the GRASP procedure for the CSG problem, 
denoted in the following with GRASP. In each iteration, it computes a solution C by 
using a randomised constructive search procedure and then applies a local search 
procedure to C yielding an improved solution. The main procedure is made up of 
two components: a constructive phase (lines 7-13) and a local search phase (line 
15). The constructive search algorithm used in GRASP iteratively adds a solution 
component by randomly selecting it, according to a uniform distribution, from a 
set, named restricted candidate list (RCL), of highly ranked solution components 
with respect to a greedy function 5 : C R. The probabilistic component of 
GRASP is characterized by randomly choosing one of the best candidates in the 
RCL. In our case the greedy function g corresponds to the characteristic function v 



A perturbative local search changes candidate solutions by modifying one or more of the 
corresponding solution components. 
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Algorithm 2 GRASP CSG 

Require: v: the characteristic function; 
A: the set of n agents; 
maxlter: maximum number of iterations; 
neighOp: neiglibourhood operator; 

riiSteps: max non improving search steps for the RII procedure; 
wp: RII walk probability 
Ensure: solution C £ M{A) 
1: (7 = 0, v{C) = -oo 
2: iter = 

3: while iter < maxlter do 

4: a = rand(0,l); 

5: C = 0; i = 

6: /* construction */ 

7: while i < n do 

8: S = {C'\C' = add{C,A)} 

9: s = max{z;(r)|T e C} 
10: s = min{i;(r)|r e C} 
11: RCL = {C' €S\v{C')>s + a{s-s)} 
12: randomly select an element C from RCL 
13: i ^ i + 1 

14: /* local search */ 

15: C = RandomisedIterativeImprovement(C, wp, riiSteps, neighOP) 

16: if v{C) > v{C) then 

17: C^C 

18: iter = iter + 1 

19: return C 



presented in Section[3l In particular, given w, the heuristic function, and C, the set 
of feasible solution components, s = min{-y(C)|C £ C} and s = max{?;(C)|C G C} 
are computed. Then the RCL is defined by including in it all the components C 
such that v{C) > s + a(s — s). The parameter a controls the amounts of greediness 
and randomness. A value a = 1 corresponds to a greedy construction procedure, 
while a = produces a random construction. As reported in [19], GRASP with 
a fixed nonzero RCL parameter a is not asymptotically convergent to a global 
optimum. The solution to make the algorithm asymptotically globally convergent, 
could be to randomly select the parameter value from the continuous interval [0, 1] 
at the beginning of each iteration and using this value during the entire iteration, 
as we implemented in GRASP. 

Given a set of nonempty subsets of n agents A, C — {Ci, C2, . . . , Ct}, such that 
Ci n Cj ^ ^ and UC C A, the function add{C, A) used in the construction phase 
returns a refinement C obtained from C using one of the following operators: 

(1) C" ^ C \ {Ci} U {Ci} where C; = C, U {a,} and a; ^ UC, or 

(2) C ^CU {Ci} where C, = {aj and Oi ^ UC. 

Starting from the empty set, in the first iteration all the coalitions containing exactly 
one agent are considered and the best is selected for further specialization. At the 
iteration i, the working set of coalition C is refined by trying to add an agent to 
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one of the coalitions in C or a new coalition containing the new agent is added to 
C. 

5.1. Local search: Randomised Iterative Improvement. To improve the so- 
lution generated by the construction phase, a local search is used. It works by 
iteratively replacing the current solution with a better solution taken from the 
neighborhood of the current solution while there is a better solution in the neigh- 
borhood. In order to build the neighborhood of a coalition structure C we adopted 
the previously reported operators SPLIT, MERGE and SHIFT, leading to the fol- 
lowing two neighbourhood relations: 

• ^^s/m(C) = {C e S\s' e SPLIT{s) U MERGE{s)} 

• Ns{C) = {C e S\s' e SHIFT{s)} 

In particular, as a local search procedure in GRASP we used a Randomised Itera- 
tive Improvement (RII) technique [TT], as reported in Algorithm |31 The algorithm 
starts from the solutions obtained in the constructive phase of GRASP, and then 
tries to improve the current candidate solution with respect to v. RII uses a pa- 
rameter wp e [0,1], called walk probability, that corresponds to the probability 
of performing a random walk step (line 12) instead of an improvement step (lines 
14-23). The uninformed random walk randomly selects a solution from the com- 
plete neighbourhood (line 12). The improvement step randomly selects one of the 
strictly improving neighbours I{C) (line 16) or a minimally worsening neighbour if 
the set 1(C) is empty (line 18). The search is terminated when a given number of 
search steps (steps) has been performed without achieving any improvement (line 
4). 

5.2. GRASP evaluation. Stochastic Local Search algorithms are typically in- 
complete when applied to a given instance of an optimisation problem and the 
time required for finding a solution may be considered as a random variable |llj . 
Given an optimisation SLS algorithm S for an optimisation problem 11 and a sol- 
uble instance tt G 11, let P{Ts,n < t, Qs,-k < (?) denote the probability that S 
applied to tt finds a solution of quality less than or equal to q in time less than 
or equal to t. The run-time distribution (RTD) of S on the specific instance tt 
is the probability distribution of the bivariate random variable (Ts.tti Qs,7r)j char- 
acterised by the run-time distribution function rtd : M+ x E+ — 5- [0, 1] defined as 
rtd{t,q) = P{Ts.Tr < t,Qs,Tr < q) [H]. To empirically measure RTDs, let k be 
the total number of runs performed with a cutoff time t', and let k' < k he the 
number of successful runs (i.e., runs during which a solution was found). Let rt{j) 
denote the run-time for the jth entry in the list of successful runs, ordered ac- 
cording to increasing run-times. The cumulative empirical RTD is then defined by 

PiT<t) #{j|rt(j) < OA- 

A CPU time measurement is always based on specific implementations and run- 
time environments. It is often more appropriate to measure run-time in a way that 
allows one to abstract from these factors. This can be done using operation counts, 
reflecting the number of operations that are considered to contribute significantly 
towards an algorithm's performance. Run-time measurements corresponding to 
actual CPU times and abstract run-times measured in operation counts may be 
distinguished by referring to the latter as run-lengths. We refer to RTDs obtained 
from run-times measured in terms of operation counts as run-length distributions 
or RLDs [H]- 
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Algorithm 3 Randomised Iterative Improvement (RII) 

Require: C: candidate solution; 

neighOP: neighbourhood operator; 
steps: max non improving search steps; 
wp: walk probability 
Ensure: C: candidate solution 
1: improvingSteps = 
2: bestValue = v{C) 
3: C^C 

4: while improvingSteps < steps do 

5: improvingStepsH — h 

6: if neighOP == SPLIT/MERGE then 

7: compute the neighborhooud N{C) = Ng/^{C) of C 

8: else 

9: compute the neighborhooud N{C) ~ Ns{C) of C 
10: u = rand([0,l]) 
11: if w > wp then 
12: randomly select C' from N{C) 
13: else 

14: I{C) = {S e NiC)\v{S) <v{C)} 
15: if /(C) ^ then 
16: randomly select C from /(C) 

17: else 

18: select C from N{C) such that V5 £ N{C) : v{C') < v{S) 

19: C = C 

20: if v{C) < bestValue then 
21: bestValue = u(C) 
22: improvingSteps ~ 
23: C = C 
24: return C 



In order to evaluate the proposed algorithms, we implemented them in the C 
language and the corresponding source code has been included in the ELK systerrQ. 
ELK includes also our implementation of the algorithm proposed by Sandholm et 
al. in [25], DP |29], IDP [tlj, and IP [22. 

All the following experimental results about the behaviour of both GRASP and 
its extension with path-relinking are obtained executing the algorithms included in 
ELK on PC with an Intel(R) Core(TM) 15 CPU 670 @ 3.47GHz and 8GB of RAM, 
running GNU/Linux kernel 2.6.32-25-server. 

The first evaluation, whose results are reported in Figure [3] and Table [1] regards 
the behaviour of GRASP adopting the SPLIT/MERGE (TV^/m) or the SHIFT (N^) 
neighbourhood relation in the local search phase. We set the number of agents to 
15, the walk probability of the RII procedure to 0.7, and a cutoff run-length to 
10^ operations. In particular, for each instance of the problem we computed the 
solution quality obtained with GRASP, computed as the ratio between the optimal 



ELK is a system including many algorithms for the CSG problem whose source code is publicly 
available at http: //www.di .uniba. it/^ndm/elk/. 
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Table 1 . Descriptive statistics for the RLDs shown in Figure [3l 
vc = stddev / mean denotes the variation coefficient, and go. 75/90. 25 
the quantile ratio, where qx denotes the x-quantile. 



sohition value and the GRASP best solution value. The cutoff run-length limited the 
number of operations of the GRASP algorithm. In particular, GRASP ends or when 
the solution quality is 1 or when the number of computed operations is greater then 
the cutoff run-length. The operations taken into account are the sum of the visited 
nodes during the construction and local search phase. Hence, maxiter is set to 
-|-oo and the GRASP stopping criteria is based on the solution quality and number 
of operations. 

We generated 100 problem instances for each distribution type (Uniform, Uni- 
form scaled. Normal, Normal scaled and Normally distributed) of the characteristic 
function. For each instance 10 different runs of the GRASP algorithm were executed. 
Figure [3] plots the graphs of the RLD for GRASP about each distribution of the char- 
acteristic function. Each graph reports the curves corresponding to the cumulative 
empirical run-length distribution when the local search uses the SPLIT/MERGE 
and SHIFT neighbourhood operators. As we can see the SPLIT/MERGE neigh- 
bourhood operator is more robust than the SHIFT operator and permits GRASP to 
find good solutions more quickly. 

An insight view of the experiment is reported in Table [U where some descriptive 
statistics for the RDLs shown in Figure|3]are indicated. The first column reports the 
adopted distribution for the characteristic function (Uniform (U), Uniform scaled 
(US), Normal (N), Normal scaled (NS), and Normally distributed (ND)); the second 
column indicates the used neighbourhood operator; mean, min, max and stddev 
indicate, respectively, the mean, the minimum, the maximum and the standard 
deviation of the operations number over the 1000 runs (10 different GRASP execution 
for problem instance); vc denotes the variational coefficient {vc = stddev / mean); 
go. 75/90. 25 is a quantile ratio; and #opt is the number of runs GRASP found the 
optimal solution within the cutoff run-length operations. From these statistics 
becomes more evident the improvement obtained adopting the SPLIT/MERGE 
operator in the local search phase. 

A second experiment was run in order to evaluate the impact of the walk prob- 
ability value in the Randomised Iterative Improvement procedure adopted as local 
search. The corresponding results are plotted in Figure |4l As in the previous ex- 
periment we run GRASP 10 times over 100 problem instances for distribution and 
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(a) Uniform 



(b) Uniform scaled 




(c) Normal 



(d) Normal scaled 



(e) Normal distributed 

Figure 3. Semi log- plot of RLDs for GRASP, with SHIFT and 
SPLIT/MERGE neighbourhood operators, applied to 100 Uni- 
form, Uniform scaled. Normal, Normal scaled. Normal distributed 
CS instances, based on 10 runs per instance. 



taking fixed the neighbourhood operator to SPLIT/MERGE. The best values are 
obtained with a walk probability equal to 0.6 or to 0.7. There is an increasing 
improvement for values ranging from 0.2 to 0.6/0.7; then the quality of the found 
solutions starts to decrease for values ranging from 0.7 to 0.95. 
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Figure 4. Semi log-plot of RLDs for GRASP with 
SPLIT/MERGE neighbourhood operator, applied to 100 Uniform, 
Uniform scaled, Normal, Normal scaled, Normal distributed CS 
instances, based on 10 runs per instance with different values of 
the walk probability for the RII procedure. 



6. GRASP WITH PATH-RELINKING FOR THE CSG PROBLEM 

Path- relinking is an intensification strategy, proposed in |10| . that explores tra- 
jectories connecting elite solutions obtained by tabu search [9] or scatter search [21] . 

Given a set of elite solutions, paths among elite solutions in the solution space 
are generated and traversed hoping to visit better solutions. Paths are generated 
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Algorithm 4 PATH-RELINKING 

Require: Xs and xt'- starting and target solution such that f{xs) < f{xt) 
Ensure: best solution x* in path from Xg to Xt 

2: V = A{x,Xt) 
3: while 2? 7^ do 

4: select m* e V such that /(x © m*) < f{x®'mi) for aU m* rrii eV 

5: V = V\ {m*} 

6: X ~ X (B m* 

7: if f{x) < f* then 

8: /* = fix) 

9: X* = X 

10: return x* 



taking into account the moves incorporating attributes of the guiding solution into 
the current one. Algorithm |4] reports the path-relinking procedure applied to a 
pair of solutions Xs (starting solution) and Xt (target solution), assuming that 
f{xs) < f{xt), where / is the heuristic function computing the solution's value. 

The algorithm iteratively computes the symmetric difference A(x,Xt) between 
the current solution x and the target one Xt corresponding to the set of moves 
needed to reach xt from x. At each step, the algorithm considers all the possible 
moves 771 G A{x,Xt) and selects the one which result is the least cost solution, 
i.e. the one which minimises f{x ® m), where a; © m is the solution resulting from 
applying move m to solution x. The best move m* is made, producing solution 
X © m*. The algorithm terminates when xt is reached, i.e. when A(x, Xt) = 0, and 
returns the best solution x* obtained among the iterations. 

Given two elite solutions a and b, some of the alternatives to relink a and b p3l[24] 
considered in this paper are: 

• forward relink: using Xs = mina,6{/(a), /(6)} and Xt = maxa,&{/(a), /(6)}; 

• backward relinking: adopting a; s = maxa,b{/(a), /(&)} and Xt = mina_b{/(a), /(6)}; 

• back and forward relinking: both different forward and backward trajecto- 
ries are explored. 

Path-relinking represents a major enhancement to the basic GRASP procedure, 
leading to significant improvements in solution time and quality, firstly proposed 
in |15j . The path-relinking intensification strategy adopted in this paper is applied 
to each local optimum obtained after the local search phase. 

The algorithm adopts a pool of maxElite elite solutions that is originally empty. 
Then, each locally optimal solution obtained by a local search is considered as a 
candidate to be inserted into the pool if it is different from every solution currently 
contained into the pool. The strategy adopted in this paper is the following. If the 
pool already has maxElite solutions, the candidate is inserted into the pool if it is 
better than the worst of them, that is then removed from the pool. If the pool is 
not full, the candidate is simply inserted. 

Algorithm [5] reports GRASP with path- relinking for the CSG problem, referred 
in the following as GRASP+PR, where a new step to the construction and local search 
phase is inserted. The path-relinking algorithm is applied to the solution returned 
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Algorithm 5 GRASP+PR 



Require: v: the characteristic function; 
A: the set of n agents; 
maxlter: maximum number of iterations; 
neighOP: neighbourliood operator; 
maxElite: max pool dimension; 

riiSteps: max non improving search steps for the RII procedure; 
wp: RII walk probability 
Ensure: solution C £ M{A) 
1 

2 

3 

4 

5 

6 

7: 

8 

9 
10 
11 
12 
13 
14 
15 
16 
17 



iter = 

P = % 

while iter < maxiter do 

C = GreedyRandomisedConstructionO /* lines 4-13 of the Alg. H * 
C = Randomisedlterativelmprovement (C, wp, riiSteps, neighOP) 
if iter > 1 then 
for all x G P do 

determine which (C or x) is the initial and which is the target 

= PathRelinking(xs, Xt) 
update the elite set with Xp 

if v{xp) > v{C) then 
C — Xp 

update the elite set P with C 
else 

insert C into the elite set P 
iter = iter + 1 
return C 



by local search and to all the solutions from the pool. Improving solutions along 
the tajectories are considered as candidates for insertion into the pool. 

6.1. GRASP with path-relinking evaluation. The first part of the evaluation 
of GRASP+PR for CSG regarded its efficacy by varying the relinking strategy and 
the size of the pool of elite solutions. In the first experiment we investigated the 
GRASP+PR efficacy by varying the relinking strategy. Figure [5] plots the graphs 
corresponding to the RLDs for GRASP+PR adopting the SPLIT/MERGE neighbour- 
hood operator and different relinking strategies. The setup of the experiment is 
the same as that used in the previous GRASP evaluations, reported in Section [521 
100 problem instances for each distribution and 10 GRASP+PR runs per instance; 
the number of agents was set to 15, the walk probability of RII to 0.7, and the 
cutoff run-length to 10'' operations. For each instance the solution quality obtained 
with GRASP+PR has been computed as the ratio between the optimal solution value 
and the GRASP+PR best solution value. As we can see from the graphs reported 
in Figure E] the FORWARD/BACKWARD strategy is the most robust for all the 
distributions, while the FORWARD strategy seems to be the less beneficial one. Ta- 
ble [2] reports a descriptive statistics for the search space nodes visited by GRASP+PR 
with a FORWARD relinking strategy, where #nodes is the total number of visited 
nodes; #construction, #local and relinking are, respectively, the number of 
nodes visited in the construction, local search and relinking phase; iter represents 
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Figure 5. Semi log-plot of RLDs for GRASP-PR, with 
SPLIT/MERGE neighbourhood operator and FORWARD 
(F), BACKWARD (B), and FORWARD-BACKWARD (FB) 
relinking strategy, applied to 100 Uniform, Uniform scaled. 
Normal, Normal scaled, Normal distributed CS instances, based 
on 10 runs per instance. 



the mean value of the iterations required by the GRASP+PR algorithm to end a single 
run. 

In the second experiment of GRASP+PR we evaluated its efficacy by varying the 
size of the pool of elite solutions. Adopting the same setting of the last experiment, 
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Table 3. Descriptive statistics for the RLDs shown in Figure [Tj 
vc = stddev / mean denotes the variation coefficient, and (70.75/90.25 
the quantile ratio, where Qx denotes the a:-quantile. 



we fixed the FORWARD rehnking strategy and let to range the maxElite parameter 
from the values belonging to the set {10, 50, 100, 250, 500}. Figure H] plots the 
obtained RLDs showing that, for each distribution, the adoption of a large maxiter 
value let GRASP+PR to quickly find the best solution. 

The third experiment compared the behaviour of GRASP with respect to that 
of GRASP+PR by considering solution qualities and the runtime performances. The 
setting of the experiment is the same: 100 problem instances for each distribution 
and 10 GRASP+PR runs per instance; the number of agents was set to 15, the walk 
probability of RII to 0.7, the cutoff run-length to lO'^ operations, the relinking 
strategy was FORWARD, the neighbourhood operator was SPLIT/MERGE, and 
the pool size of the elite solutions to 10. Figure [7] plots the obtained RDLs and the 
Table [3] reports the corresponding descriptive statistics. 

Finally, even if GRASP+PR is not a complete algorithm returning the optimal 
solution, we compared it to the two best performing algorithm able to return the 
optimal solution. Given different numbers of agents, ranging from 10 to 18, we 
compared GRASP+PR to IDP and IP reporting the time required to find the optimal 
coalition structure. As reported in Figure [3 where the time in seconds is plotted 
in a log scale, GRASP+PR outperforms both IDP and IP for non scaled distributions. 
Its efficacy is comparable to that of IDP and IP on the scaled distributions. 
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Figure 6. Semi log-plot of RLDs for GRASP+PR with 
SPLIT/MERGE neighbourhood operator and FORWARD 
relinking strategy, applied to 100 Uniform, Uniform scaled. 
Normal, Normal scaled, Normal distributed CS instances, based 
on 10 runs per instance. Curves refer to different values for the 
size of the pool of elite solutions. 



7. Implementation details 

7.1. The characteristic function. Concerning the representation of the charac- 
teristic function and the search space, given n agents N = {ai, 02, . . . , On}, we 
recall that the number of possible coalitions is 2" — 1. Hence, the characteristic 
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Figure 7. Semi log-plot of RLDs for GRASP and GRASP+PR 
with FORWARD relinking strategy and SPLIT/MERGE neigh- 
bourhood operator, applied to 100 Uniform, Uniform scaled, Nor- 
mal, Normal scaled. Normal distributed CS instances, based on 10 
runs per instance. 



function v : 2" R is represented as a vector CF in the following way. Each subset 
5 C j4 (coalition) is described as a binary number cb ~ &i&2''-&n where each 
hi = 1 if fli G 5, hi = otherwise. For instance, given n = 4, the coalition {02,03} 
corresponds to the binary number 0110. Now, given the binary representation of 
a coalition 5, its decimal value corresponds to the index in the vector CF where 
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Figure 8. Semi log-plot of run times for GRASP-PR, with 
SPLIT/MERGE operator and FORWARD relink type, and IP ap- 
plied to 10 Uniform, Uniform scaled. Normal, Normal scaled. Nor- 
mal distributed CS instances, based on 10 runs per instance. 

its corresponding value v{S) is memorised. This gives us the possibility to have a 
random access to the values of the characteristic functions in order to efficiently 
compute the value v of a coalition structure. 

7.2. Coalition structure. Given a coalition structure C = {Ci, C2, ■ ■ ■ , Ck}, as- 
suming that the d are ordered by their smallest elements, a convenient representa- 
tion of the CS is an integer sequence did2 ■ ■ ■ dn where di — j, if the agent Ui belongs 
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to the coalition Cj. Such sequences are known as restricted growth sequences |18| 
in the combinatorial literature. The binary representation of the coalition d is 
^1^2 ■ • ■ &n where hj = if dj ^ i, and bj = 1 otherwise. 

For instance, the sequence corresponding to the coalition structure C ~ {Ci, C2, C3} = 
{{1, 2}, {3}, {4}} is 1123. Now in order to compute v{C), wc have to solve the sum 
v{Ci) + v{C2) + ^(Cs), where Ci corresponds to the binary number 1100, C2 cor- 
responds to the binary number 0010, and C3 corresponds to the binary number 
0001. Hence, v{C) = v{Ci) + v{C2) + viCs) = CF[11002]+ CFCOOlOal+CFCOOOlal 
= CP [12] +CF [2] +CF [1] , where CP is the vector containing the values of the char- 
acteristic function. 

8. Conclusions 

The paper presented an algorithm applyable to cooperative complex problems 
that require to find an optimal partition, maximising a social welfare, of a set of 
entities involved in a system into exhaustive and disjoint coalitions. We present a 
greedy adaptive search procedure with path-relinking to efficiently search the space 
of coalition structures of those gouping problems. As reported in the experimental 
section the proposed algorithm outperforms in some cases the state of the art 
algorithms in computing optimal coalition structures. 
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